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ABSTRACT I 

Desirable properties of robotics vision database systems are given, and structures which possess 
properties appropriate for some aspects of such database systems are examined. Included in the 
structures discussed is a family of networks in which link membership is determined by measures of 
proximity between pairs of the entities stored in the database. This type of network is shown to 
have properties which guarantee that the search for a matching feature vector is monotonic. That is, 
the database can be searched with no backtracking, if there is a feature vector in the database which 
matches the feature vector of the external entity which is to be identified. The construction of the 
database is discussed, and the search procedure is presented. A section oo the support provided by 
the database for description of the decision-making processes and the search path is also included. 

I. Introduction / 

Several structures have beeo proposed which have properties desirable for use in a robotics vision database system. 
Some of these structures are examined in this paper, inducting a new family of networks in which link membership is 
determined by measures of proximity between pairs of entities represented by nodes in/the database and the triangle ine- 
quality. / 

Suitable domains for the database structures considered here are those for which the entities to be stored are 
describable by a few feature vectors, e.g. color, or shape using Fourier descriptors. We consider the following to be 
desirable properties for such a database: / 

1. The database system should support efficient search, so that the feature vector which 
provides the best match to some external entity can be found quickly; 

2. The structure should support classification, <$o that external entities can be named and 
higher levels of abstraction are supported;/ 

3. The structure should support a modest level of self-dcscnption, so that entities along 
a search path provide information abbut the template-matching and classification 
decisions being made; 

4. The neighbors of an entity should reflect consistency with respect to class, so that 
entities within a given mode of a class should be stored in a manner that reflects 
the associations and enhances retrieval; 

/ 

5. Learning, considered to be the addition of entities to the database system, should be 
done in such a way that the previous properties are preserved. 

While this list is too demanding to be well satisfied by any structure known to the authors, it is informative to 
explore the limitations of the various structures. The paradigm of preprocessing the data (entities to be represented in 
the database) so that search is facilitated is an important concept, selected by Dobkin and Lipton | 1 ], Bentley and Fried- 
man |2|, and Bentley and Maurer [31- Dobkin and Lipton |1| extended binary search to multidimensional search prob- 
lems, and could efficiently respond to queries which included the nearest-neighbor problem. Bentley and his colleagues 
specialized in range searching queries, in which it is desired to find all entries in the database in which each component 
of the feature vctJore is within some given range. 

The k-d trees and range trees discussed by Bentley and Friedman [2] arc interesting structures, designed specifi- 
cally for range queries. Range queries are, of course, extremely useful for a wide variety of applications, and can be 
used as a son of de facto classification scheme. Other papers presenting results of studies using k-d trees or range trees 
are Bentley and Maurer [3| and Chang and Fu |4|. 'Hiese methods suffer the limitations imposed by any hierarchical 
scheme in terms of descriptive power and classification strategies, since abstractions arc necessarily limited to those 
representable by hierarchies. Category information is, in fact, not a strong point of ihesc methods, since only hierarchi- 
cal neighbors of matching entities are readily available. 
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Kalvio, ct a!. [5], have developed a technique for pattern recognition based on geometric descriptions of the 
boundaries of objects. This technique is designed specifically for identification of overlapping and partially oeduded 
objects. Attributes used for matching arc derived from geometric features of segments of the boundaries of a set of 
objects. The search procedure is based on geometric hashing of objects in 5-space, where the coordinates are attribute 
values obtained during preprocessing of tbe data. A set of candidate matches (models) is selected on the basis of fre- 
quency of inclusion in the hypercubes of 5-spacc in which the unknown’s attributes place it. A match rate is computed 
for each candidate match (model) based on the ratio of the number of match points for the model and the number of 
possible match points for a particular unknown. The database organization provides for efficient search for the specific 
application for which it was intended; however, no categorical information is provided and varying levels of abstraction 
are not supported. 

Pathfinder networks share some attributes and objectives with memory-based reasoning, as discussed in Stanfill 
and Waltz |6]. Both paradigms make use of feature values to compute distance or dissimilarity functions, search 
memory for best match(es), and classify entities; and both paradigms share the philosophy of classifying entities by direct 
reference to memory. Pathfinder networks, however, arc organized algorithmically so that associations arc explicit, 
which results in categories being evident in the link structure. The search procedure described in Section III of this 
paper guarantees that search is monotonic (i.c., there is no backtracking) if there is a match in the datahasc. so that 
search is not exhaustive, as it is in the scheme used for the Connection Machine described in Stanfill and Waltz (6|. 
Furthermore, the memory-based reasoning paradigm does not support descriptions of the search path and the decisions 
made which contribute to classification. 

"Description" is used here to mean that the salient feature values of entities along the search path, the search path 
itself, the reasons for selecting the search path, and the neighborhood of the goal node, are available for a summary of 
the entire process. Section IV of this paper is devoted to a discussion of descriptive processes supported by the database 
developed here. 

The organization of the entities in the database is based on a network model of semantic memory in humans. 1 his 
model is called Pathfinder, and the properties of Pathfinder networks (PFNETs), previously called link-weighted net- 
works (LWNs), are described in Dcarholt, Schvaneveldt, and Durso [7 1 and in Schvancveldt, Dcarholt, and Durso |8|. 
Earlier work on databases intended for vision systems is described in Dcarholt, Gonzales, Ellington, and Phillips {9|, 
and in Dearholt, Gonzales, and Kirpekar (10). These database schemas also used Pathfinder networks. Motivations for 
the database described in the latter paper and extended in this paper include (1) increased efficiency in the search pro- 
cess by eliminating backtracking; (2) the efficient determination whether or not a given entity is represented in the data- 
base; (3) organization of the database so that similar entities are clustered together; (4) provision for category -level 
information by means of the clustering inherent in Pathfinder networks; and (5) support of description of the search and 
classification processes. 

The efficiency in the search process is accomplished at the time the network is generated by establishing links 
which provide a path between any two nodes so that the relative distance between nodes, as the path is traversed, is 
monotonically decreasing. Then, if a feature vector representing an external entity is presented to the database as a 
query, the corresponding node can be found rapidly from any node in the database. Heuristics to improve the initial 
node of the search can further improve the search efficiency. The determination that a given entity is not in the data- 
base follows from the procedure to be described in Section Hi. llie clustering of similar entities and the resultant 
category-level information is a feature of the PFNETs to be described in Section II. lhesc features provide a basis for 
the support of the description processes to he discussed in Section IV. 


tl. The Generation of a Pathfinder Network Database 

Because PFNETs provide for clustering of similar entities, they seem to be a good paradigm for a database organi- 
zation; indeed, their original purpose in modeling the semantic memory of humans provides for a database of concepts. 
Thus it seemed natural to extend PFNETs to feature-based applications in which each entity is desenbed by a feature 
vector. Vision systems used for pattern recognition and image analysis are well served by such databases, and the pro- 
perties of PFNETs support search, classification, and description, as mentioned previously. Our first effort in this direc- 
tion was the database for insect identification (Dearholt, et al., (9|), in which PFNETs were used to organize the data- 
base. PENETf-x, n-l) was used because it is the PFNET with fewest links, hut there was no very effective search pro- 
cedure associated with this PFNET organization. Our second effort (Dearholt, et al., j 1()|) justified and desenbed the 
construction ot the PFNETs which guarantee monotonic search. Ihe purposes ot this paper are to list the desirartle pro- 
perties of a vision database for robotics, and to describe the results of our work with Pathtinder networks relevant to 
these properties. It should thus be regarded as a progress report of our project, wntten tor the purposes ot the 
workshop. 
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Tbe database schema we will present relies on the Pathfinder network model as a means of organizing the entities 
in the database. Development of this mode! (previously called link-weighted networks) has been ongoing fix the past 
six yean. Pathfinder yields network structures (PFNETs) for a set of entities, given estimates or measures of the pair- 
wise distances between the entities. The original purpose of Pathfinder models was to model human semantic memory, 
so that the estimates of distances were typically estimates of similarity. For the database schema discussed here, how- 
ever, the entities are each represented by a feature vector, and distances between pairs of entities arc presented to tbe 
system as a weight matrix. If the weight matrix is symmetric, then the PFNETs derived from it are undirected, whereas 
an asymmetric weight matrix yields directed PFNETs. 

A maximally connected network contains a link between every pair of nodes, so that each weight in the weight 
matrix is represented by a link. Such a network contains all the original information in the data, but it provides very lit- 
tle information about the structure underlying the data. Pathfinder includes only the links necessary to preserve geo- 
detic paths, thus facilitating analysis and interpretation. T wo parameters are required for the complete definition of a 
PFNET for a particular weight matrix. These are the r-metric and the (/-parameter. The /--metric « the value of the 
Minkowski parameter which is used to compute the distance between nodes in the network which are not directly 
linked. That is, the weights along the path used to compute distance are individually taken to the r power, these values 
arc summed, and the rth root of the resulting sum is the distance. In general, 

dOv.jV/H t w k '\» 
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where the w k arc the weights along the path between N, and Nj. The r-metric parameter may take on values from 1 to 
oo. The (/-parameter determines the maximum number of links in paths considered to connect two nodes. For example, 
for q = 2, paths having more than two links are not considered in the preservation of minimum-distance (geodetic) 
paths in the PFNET. 

PFNETs possess properties of inclusion which vary as values of the r-metric and (/-parameter change (Dearholt, el 
al., [71). Briefly, for a particular weight matrix, PFNET? is a spanning subgraph of PFNET1 if and only if the r-mctric 
used for PFNET1 is less than or equal to the r-mctric used for PFNET2, provided that the (/-parameter is held constant. 
In addition, for a particular weight matrix, PFNET2 is a spanning subgraph of PFNEIT if and only if the value of the 
(/-parameter used for PFNEIT is less than or equal to the value of the (/-parameter used for PFNET2, provided that the 
r-mctric is held constant. The PFNET generated with r - ® and q = the-number-of-nodes-less-ooe always has the 
minimum number of links and is the union of all minimum cost spanning trees of an undirected PFNET'. 

As a PFNET is constructed, precedence is given to small weight values, because .they represent the strongest asso- 
ciations. During each stage of development of an undirected PFNEI\ the complete set of nixies is partitioned into con- 
nected subgraphs, called node sublists. When a link is added which joins nodes in different sublists, the two sublists arc 
merged to form a single node sublist. Links in an undirected PFNEI are labeled according to the basis for their inclu- 
sion in the PFNET. The four types of link labels arc PRIMARY, SECONDARY_A, SECONDARY_B, and I ER 1 1- 
ARY. A PRIMARY link provides the only path between a node sublist containing a single node and some other node 
sublist. A SECONDARY link joins two sublists which arc not connected, and in which there are either alternate paths 
to terminator nodes, or the node size of both node sublists exceeds one. SECONDARY„A links are in all minimum 
cost spanning trees. SECONDARY_B links are in only some minimum cost spanning trees, as they provide alternate 
paths of the same length between two nixies. A TERILARY link joins nodes within a single node sublist. TERTI- 
ARY links arc not in any minimum cost spanning tree. Ibe link-labeling rule yields important structural information, 
and the potential use of link labels in the descriptive processes will be discussed in Section IV. 

Investigation of transformations on the values of the weight matrix has yielded two results of importance relating 
to the structure of PFNETs. A multiplicative transformation applied to the elements of a weight matrix preserves link 
structure in the PFNET for any values of r or q. A monotonic transformation applied to data in a weight matrix 
preserves the structure of the PFNEI' only for r = «. 

'Hie construction of the database for vision applications presumes a set of entities and some procedure to derive 
feature vectors to represent these entities. Typically, tiie entities to be represented by nodes in the database are exam- 
ined to obtain salient features. Each class of feature values is presented as a vector; e.g., a color descriptor could 
include intensity values obtained from red. green, and blue filtered images. Similarly, shape descriptors might consist of 
a vector of Fourier coefficients. Difference measures for this paper are obtained using the L 1 norm (the computation is 
the same as the Minkowski distance for r = I) for each pair of entities, to obtain the weight matnx for input to Path- 
finder. 

T he Pathfinder model preserves all geodetic (minimum cost) paths having no more links than the value of the q 
parameter, and leads to clustering based upon similarity of nodes. Pairs of nodes which are not directly linked in a 
PFNEI' are likely to be in different categories or suheategories. PFNEls provide a means of sailing data similar in 
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some respects to clustering methods (e.g., Shepard and Arabie, [111) and to midtidimcnsiooal scaling (Kruskal [12]), 
but the finksin PFNETs provide information not directly available in clustering or in multidimensional sailing. Another 
network scaling scheme is NETSCAL (Hutchinson, [131), but unfortunately Hutchinsoo did not consider mangle ine- 
qualities of dimension greater than two. 

The domain assumed for the database consists of those problems in which each entity in the database is 
represented by a vector having d feature values, and corresponding features have their feature values in comsponAng 
locations in the feature vectors. In addition, we assume that the features of the entities are such that taking the differ- 
ence of corresponding feature values (as a part of applying the Ll norm) is appropriate. To begin the process of gen- 
erating a PFNET, it is necessary to compute a scalar weight matrix W For the purposes of this paper, we will compute 
the ’ralar weight values of W using the LI norm. For the Lm norm. 

»V=i £ Wr"j\ m \ Vm 
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The LI norm sums the magnitudes of the differences between corresponding components for the feature vectors 
being compared. If the L2 norm were used, it would be the Euclidean metric. For a discussion of distance measures 
suitable for data such as this, see Tversky and Krantz (14|. This article also justifies the Lm norms (the Minkowski 
metrics) as being the only metrics which possess both intradimensional subtractivity and mterdimensional addmvity. a 
feature that seems as important for vision databases as for cognitive modeling. 

Although the r-metric parameter used with PFNETS can vary from 1 through «. for the purposes of the databases 
described in this paper, oo will be used. The consequence of this is that the distance between nodes not directly linked is 
the value of the largest weight along the path connecting the nodes. This is sometimes called the dominant metric. Fbe 
databases in this paper use q = 2. The notation used for a PFNET in which feature vectors are used to compute W is 
PFNETf Lm , r, q), with the parameters in parentheses corresponding to the parameters discussed above. 

A primary purpose of the databases constructed as PFNETs is to support effective search, so that notation for 
search paths is helpful. Search paths begin at some initial node, follow links established m the construction of _ the 
PFNEF, and end at a node. Since there is at most one link between any two nodes, we will denote a search pat from 
node N, to node N k (passing through Nj as 

P(N„N 


,V. is said to be a predecessor of N t because it precedes N J in the search path. One way ot viewing the network is to 
think of the entities N, as points in d-dimensional space, establishing links according to the link membership rule of 
PFNEIs, and traversing these links according to the search procedure to be described. 

Another concept of importance is the lune of two nodes. The lune is discussed in loussaint ji5| in his definition 
and discussion of relative neighborhood graphs (RNGs). Lunes are also discussed in Lee |16j and in Katajainen and 
Nevalainen [I7j. The lune of two nodes (points) and will be denoted by lune(jV,, <V ; ), and is defined as the set of 
points in which each point has a distance (we’ll use the Ll norm, rather than the Ll norm as in the original work on 
RNGs) from both N, and JV less than the distance between N, and N r In the weight matrix W, these intemode dis- 
tances using the Ll norm are already computed. Using Ll, lune(N,, l V ; ) is a rectilinear figure of dimension d. If L2 
were used, lune(N,, N } ) would be the set of points in the intersection of two hyperspheres of dimension d. 

A notable difference between RNGs and PFNETs is in the assumptions usually made about the input spaces. For 
the RNG. two-dimensional space ts normally assumed for the input data, but no such constraint is necessary for 
PFNEIs If the ; nput spaces are two-dimensional, however, then the link membership of an RNG using L2 is such that 
the RNG is a spanning subgraph of PFNBI'(L2, r=2, q = 2). In the RNG, two nodes .V, and .V, are linked directly if 
and only if there is no other node in the lunef.V,, N ; ). Euclidean distance is used in the 2-spuce in which the entities are 
customarily represented as vectors for the RNG, so that the lunes are intersections of circles. 

Although PFNET(L2, r- 2, q^2) is satisfactory for a database organization, in terms of search for a matching 
entity in the database, the use of PFNEr(Ll, =. 2) is preferable because the latter requires less computation in both the 
construction of the network and in search. Using Ll, and assuming two dimensional input space, the link membership 
for an RNG is the same as the link membership of PFNET(L1, r-<*>. q = 2). For cither, the link membership rule is 

l if is in PFNEIXL 1 , 2) if and only if 

w v ^ min | max j ] 

over ail two-link palhs between .V, and ,V ; . 
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This definition can be viewed as providing a link membership rule for one of a family of PFNETs, or as an extension at 
the RNG to a new application making use of the Z. I metric. 

The reason this PFNET is efficient in searching for a matching entity in the database is that the link placement is 
such that backtracking is never needed, provided that there is a matching node in the database. 


III. Moootonic Search of a Pathfinder Database 

Efficient, monotonic search, in which there is always a link to direct the search path(s) toward a node matching 
the external entity to be identified, is one of the attributes of a database organized as a Pathfinder network. Justifica- 
tion of monotonic search of a PFNET(L 1 , <», 2) database follows. Consider that a set of entities N t has been established 
with their corresponding feature vectors, and that the scalar weight matrix IV has been computed using the Z.1 normr. 
Suppose that the PFNET(Ll, «, 2) has been constructed, and that E, is an external entity represented by a feature vec- 
tor compatible with the feature vectors of the N t in the database; it is desired to find the N k which provides the closest 
match to E t within the database. Further suppose that the initial node (the node where search is initiated) is chosen to 
be N it and that node N k in the database is a match for E t . 

That is, we assume for this discussion that the feature vectors for E x and N k are identical. Ihe goal is to find a 
path between and N k , applying the match criterion at each node along the search path, until it is determined that E M 
does indeed match N k . An appealing argument can be made through the use of the luncs defined by the nodes along 
the search path. Consider lune(^, N k ) --if there is no other node in this iune, then and N k are linked directly, and a 
one-link searcb path connects the initial node with the goal node. Alternatively, if there is another node in lune(/V f , 
N k ), then and N k are not directly linked. But each node in the lune(fy, N k ) is closer to N k than N, is to N k . so that in 
progressing to any node in the interior of the Iune, we diminish the distance to N k . The node N J closest to N, will be 
linked to N it and the search path can proceed to Nj. The search path can, however, proceed to any node in lune(.V ; , N k ) 
which is linked to N t , and it is usually advantageous to go to the node which diminishes the distance to the goal node the 
most. Suppose the search progresses to node N r Here, the process is repeated with lune(W ; , N k ), and every node in this 
Iune is closer to N k than is N ] , so again the distance to the goal diminishes. In this fashion, the goal node is reached 
using only distance measurements between E t and the nodes which are candidate successors for nodes on the searcb 
path, since we assume that E t and N k have the same feature vectors. That is, at node N Jt the difference between E x and 
nodes linked to A/ y is taken, and the difference which is smallest determines the next node in the search path. Thus the 
link structure guarantees that no backtracking is ever necessary if the entity E, is in the database. If the distance from 
some node in the search path to E t does not reach zero and cannot be diminished, then this indicates that there is no 
node which exactly matches E t in the database. 

A matching criterion based upon network properties is under development, although some aspects of a match cri- 
terion arc necessarily dependent upon the problem domain. Throughout this paper, we presume that the matching cri- 
terion requires the goal node and E t to match much more closely than E„ matches any other node in the database- 
Refinement of the theory of matching criteria is an area we are continuing to investigate. 

There are four aspects of the search process for a database as described a.x>ve, although the fourth is not always 
needed. These are: 


(1) The selection of an initial node from which to begin the search, 
usually by means of some heuristic 

(2) Ihe selection of a path from the initial node to the best matching 
node in the database. 

(3) lhe application of the match criterion to each mxle along the path 
through the database, to determine whether any node on the path is 
a satisfactory match for the feature vector representing the entity 

to be identified. 

(4) Ihe determination of nearest neighbors of the node most nearly matching 
the external entity. Pathfinder networks support this search for 

nearest neighbors, because the link structure preserves geodetic 
(minimum) distances throughout the network. 
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The ickctioa of an initial node can be accomplished by means of an index on some of the most salient features, so 
that, most of the time, the initial node is in the same class as the entity to be identified. Search efficiency is enhanced, 
of course, if the search is begun in the proper category. But the property of PFNET(Ll,°°,2) of guaranteeing that from 
each node, the distance to every other node in the database can be diminished by traversing some link assures that the 
choice of the initial node does not affect the convergence to the goal node if the tatter is in the database. This is impor- 
tant because it is not alway passible to begin the search at a node in the same category as the goal node. 

The search procedure consists of the following steps, to be done at each node in the search path, from the initial 
node to the node at which the decision regarding a match can be made: 

(1) The match criterion is applied to the present node (starting with 
the initial node) to determine whether or not the present node is 
a satisfactory match with the entity £,. If the match is 
satisfactory, then bait. 

(2) The distance d(N it E t ) between each node N, adjacent (linked) 
to the present node and the entity E M is computed using the L\ 
metric. 

(3) The node which decreases the distance d(N t ,E t ) the mast is 
selected as the next node in the search path, if it is not possible 
to decrease the distance to the goal node (represented by E x ) 
then there is no matching node in the database. Otherwise, 
return to step one. 


As an example, consider the set of nodes 


A'l = (2. 1) N$ - (9, 4) 

JV 2 -(4.1) A/ 6 = (9, 6) 

N . 3 - (5. 4) N 7 = (7, 8) 

N 4 - (3. 5) = (10, 8) 

The weight matrix for this set of feature vectors, computed using the L 1 norm, is 


W = 


0 2 
0 


6 5 
4 5 
0 3 
0 


10 12 
8 10 
4 6 

7 7 

0 2 
0 


12 15 
10 13 

6 9 

7 10 

6 5 

4 3 

0 3 

0 


The PFNET(ZJ,°°,2) is constructed using IV, and is shown in Figure 1. 


N 7 N s 



Figure 1 
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If the search is started at » (2, 1), with E t - (10, 8) « N t , then the search path is ?iN x , N+, N if Afj), and 
the search procedure baits at N t with a match. Links are followed at each step, and the distance to the goal node 
decreases monotooicaliy at each node. 

The match criterion necessarily has some aspects which are domain dependent, but ooutd be as simple as requiring 
tha t the distance between the goal node (providing the presumed best match) and E x be less than some threshold value 
computed from the smallest distances between nodes in the database. Refinements could indude the addition of an ele- 
ment of context in the sense that each category may have somewhat differing variability associated with satisfac- tory 
matching. The search process outlined above does not guarantee that the path with fewest links will be found, but it 
does g uaran tee that a path to a matching node will be fouod in which the distance decreases monotooicaliy along the 
search path. If there is no node in the database which is an exact match to this is determined when the di st a n ce 
from a node Nj (on the search path) to E, is larger than the distance from the preceding node in the search path to E x . 
In this case, using the II norm, then it is not possible to get a better match to E t than the predecessor of Nf, if that is 
not a satisfactory match, then E x is considered not to be in the database. Formal proof of this is forthcoming in 
Dearholt (18|. 


IV. Description of Decisions and Neighborhoods 


For an intelligent system, it is desirable to have support for the description of decisions made during the search 
process. This information can be very useful for communicating with the system in an attempt to understand not only 
the classification decision for a particular entity, but the properties of the neighborhood surrounding the entity in the 
network. The latter information can, of course, be used in some of the more sophisticated classification algorithms. 
Because of the clustering properties of PFNETs, and the directness of the search process associated with PFNET(f.l. 00 . 
2), there is substantial information available regarding the classification results. The categories of nodes along the 
search path, and values of some of their most salient features, are the principal pieces of information used for our 
descriptive processes. We will focus mainly on four issues: 

(1) The node where search is initiated, 

(2) The search path, 

(3) Link labels, 

(4) The classification decision. 

The selection of the initial node for the search procedure is a very significant decision. Although the PFNET(L1 , 
oo, 2) guarantees convergence between any pair of nodes, the search time can be lessened substantially in a large data- 
base by judicious selection of the initial node. Beginning at a node which is in the same cluster as the goal node is a 
desirable objective; but the solution of this problem would imply that the classification problem for the entities in the 
domain is also solved. For many domains, the selection of a few key features which often lead to correct classification 
can be used to provide a sort of indexing into the network, so that the initial node could be the node having highest 
degice in the category indicated by the feature values in the set of key features. These key feature values, and the 
heuristic selection of an initial node based on them, are thus a pan of the descriptive process at the beginning of the 
search. 

As the search begins from the initial node (selected by some heuristic), at each node N, in the search path some 
decision is made based upon the distance between the external entity and the nodes linked to A/ i . The most suitable 
strategy, as discussed in Section III, is to select the node which most deer lses the distance to the goal node, although 
there is no guarantee that the goal nixie will be reached in the fewest steps by using this strategy. The values of the 
feature vectors of the nodes which are candidate successors to N, are available, and for the Nj which is the successor 
node to N,, the feature value(s) which are most responsible for diminishing the distance to the goal are available for use 
in description. Together, these feature values arc an indication of progress made toward the goal entity, since back- 
tracking is never necessary. Furthermore, if there arc multiple search paths (previously defined as the paths leading to 
the gpal nixie), properties of all of the search paths can provide important information to the descriptive process. 

The link labels on the links traversed also have some significance, as pointed out in Section II. PRIMARY and 
SECONDARY_B links arc typically indicative that the search is progressing within a category, while the traversal of a 
TERTIARY link in a PFNEI'd 1, =>c. 2) seems to indicate that the search path has progressed to a new category or sub- 
category. The traversal of a SECONDARY, A link also usually indicates entry into a new category. Pnxifs of these 


ob servations arc difficult aod not yet available, partly be c au se the definition of “category" or "subcategory is difficult. 
Wc are continuing to investigate the information provided by link labels, however, and that information is available for 
description also. 

The last part of the search involves classification of the external entity, if that is passible. The choices of nodes 
available at the last step of the search provide information regarding the dass and the centrality of the entity, so that 
some level of confidence in the decision of category could be assigned. That is, if tbe match with the goal node were 
borderline, and the node were near another category, then the confidence in the classification should not be very peat. 
But if the goal node matrhwl quite well and were also in the "center" of a category, then the confidence of the classifica- 
tion should be high. The Pathfinder paradigm supports a local search for nearest neighbors, so that this information can 
be used in either the classification decision or in the description of the neighborhood surrounding the goal node. The 
search for neighboring entities can be viewed as search by spreading activation, which would leave the god oode and 
travel to the neighboring nodes, so that their feature values and classification is available. Thus tbe centrality of the 
goal oode, and its relationships to other nodes and categories, can be readily determined. 


V. Summary and Conclusions 

The construction of a database for vision applications using Pathfinder networks (PFNEI's) was described, and it 
was shown that the search procedure associated with this database organization is monotonic (a distance measure 
steadily decr eases at each node throughout the search path), provided there is a node in the database which matches the 
external entity. Relationships with the relative neighborhood graph (RNG) were discussed, and the search procedure 
was described. The database organization and search procedure provide a basis for descriptive processes of the decisions 
made along the entire search path, from tbe initial node to the goal node (or to the point where it is decided that there 
is no match in the database). Description based on these feature values and changes in feature values along the search 
puth(s) and in the neighborhood of the goal node is expected to be useful in enhancing communication between the 
vision or robotics system and humans working with the system. 

Research is continuing on some of the open questions encountered thus far. rhe match criterion used to deter- 
mine whether the external entity matches some node in the search path “satisfactorily" is an important problem, and it 
has some domain-dependent properties and some characteristics which can be determined from a graph- theoretic per- 
spective. The precise role of the PRIMARY, SECONDARY, and TERTIARY links in PFNET(L! . *, 2) is also being 
studied, particularly how these link labels relate to category structure of tbe network. ITic descriptive processes arc 
under investigation from tbe perspective of graph theory, although there seems to be a domain-dependent aspect also. 
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